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Proprietary vs. Open-Source Al Models 


Free to use but requires 


Cost Expensive for heavy usage resources for training 


Transparency Limited, black-box models Full transparency of models 


Fully customizable and 
trainable 


Customization Limited to API fine-tuning 


Professional, enterprise- 
grade 


Examples GPT-4, Bard, Claude GPT-J, BLOOM, LLaMA 


Support Community-driven support 
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Large Language Models (LLMs) 
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“Once GPT “In a quaint village 
upon a time nestled between rolling hills 
and meandering streams, 
there existed [...]. " 


N-token Prompt M-token Completion 


Natural 
language 
input 


How language models work 


Pre- 
processing 


Many words map to one token, but some don't: indivisible. 


Unicode characters like emojis may be split into many tokens containing 
the underlying bytes. 


Sequences of characters commonly found next to each other may be grouped 
together: 1234567890 


TEXT 


- 3 to 4 ripe bananas, mashed 
- 1/3 cup meltec 

- 1cup sugar 

-legg, beaten mashed = 
-lteaspoonvar medium = 


Get results 


-1teaspoonbal Вап = 1.22% 
- Pinch of salt 

- 11/2 cups of al 

- Optional: 1/2 c 


Probability distribution 


Natural 
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How language models work 
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How language models work 
Tokens 


Tokens Characters 
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We need to stop anthropomorphizing ChatGPT. 


https://platform.openai.com/ 
tokenizer 


How language models work 
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Adding knowledge to the prompt 


Meta Prompt 


## Task 

You are an Al agent for the Contoso Trek outdoor products retailer. 
As the agent, you answer questions briefly, succinctly, and in a 
personable manner using markdown and even add some personal 
flair with appropriate emojis. 


## Documents 
The following documentation should be used in the response. The 
response should specifically include the product id. 


TrailWalker hiking Shoes 

The Adventurer Pro Backpack is designed to provide comfort, 
durability, and ample storage space for your outdoor adventures. 
Familiarize yourself with the key features of the backpack [...] 


TrekHiker Walking Booths 

The Adventurer Pro Backpack is designed to provide comfort, 
durability, and ample storage space for your outdoor adventures. 
Familiarize yourself with the key features of the backpack [...] 
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Response 
| 


Sure, I'd be 
happy to help! 
© Based on 
the available 
documentatio 
n, | can 
recommend 
two choices 
from the 


Contoso Trek 
catalog. 


1.Product: 
TrailWalker 
Hiking 
Shoes ID: 
36244753 Bra 
nd: IrekReady 
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Open-Source Al Models and Contributors 
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What is Hugging Face 


Hugging Face itself is not an AI model, it is a platform that 
hosts models from the broader Al community. 

It provides easy access to models such as GPT-Neo, BERT, о 
BLOOM, etc. 


Think of Hugging Face as a central hub where models live, 
rather than being a standalone Al model. 


. ЧӨ what: is Hugging a 


Hugging Face Key Contributions 


Model Hub: 


Hugging Face provides a repository of over 100,000 pre- 
trained models that can be used in tasks ranging from NLP 
(e.g., GPT-Neo, BERT) to computer vision and more. о 


Easy-to-Use Libraries: 


Libraries such as transformers, datasets, and tokenizers 
simplify the training, fine-tuning, and deployment of Al mode 


e ЧӨ us is Hugging US 


Collaboration: 


Hugging Face has collaborated with major Al research groups 
like BigScience (creators of BLOOM) and EleutherAl to host 
and distribute open-source models. 


Inference API: 


It also provides APIs to deploy models in production without 
needing to worry about infrastructure, making Al more 
accessible. 


е о 
Demo Time 


-Create your own GPT-Like Model and make it opensource 


e  lhanks! 


Do you have any questions? „ц. 
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